Lexicalized Stochastic Modeling of Constraint - Based Grammarsusing Log - Linear Measures
نویسندگان
چکیده
We present a new approach to stochastic modeling of constraint-based grammars that is based on log-linear models and uses EM for estimation from unannotated data. The techniques are applied to an LFG grammar for German. Evaluation on an exact match task yields 86% precision for an ambiguity rate of 5.4, and 90% precision on a subcat frame match for an ambiguity rate of 25. Experimental comparison to training from a parsebank shows a 10% gain from EM training. Also, a new class-based grammar lexicalization is presented, showing a 10% gain over unlexicalized models.
منابع مشابه
Lexicalized Stochastic Modeling of Constraint-Based Grammars using Log-Linear Measures and EM Training
We present a new approach to stochastic modeling of constraintbased grammars that is based on loglinear models and uses EM for estimation from unannotated data. The techniques are applied to an LFG grammar for German. Evaluation on an exact match task yields 86% precision for an ambiguity rate of 5.4, and 90% precision on a subcat frame match for an ambiguity rate of 25. Experimental comparison...
متن کاملDeriving lexical and syntactic expectation-based measures for psycholinguistic modeling via incremental top-down parsing
A number of recent publications have made use of the incremental output of stochastic parsers to derive measures of high utility for psycholinguistic modeling, following the work of Hale (2001; 2003; 2006). In this paper, we present novel methods for calculating separate lexical and syntactic surprisal measures from a single incremental parser using a lexicalized PCFG. We also present an approx...
متن کاملStochastic human fatigue modeling in production systems
The performance of human resources is affected by various factors such as mental and physical fatigue, skill, and available time in the production systems. Generally, these mentioned factors have effects on human reliability and consequently change the reliability of production systems. Fatigue is a stochastic factor that changes according to other factors such as environmental conditions, work...
متن کاملA Risk-averse Inventory-based Supply Chain Protection Problem with Adapted Stochastic Measures under Intentional Facility Disruptions: Decomposition and Hybrid Algorithms
Owing to rising intentional events, supply chain disruptions have been considered by setting up a game between two players, namely, a designer and an interdictor contesting on minimizing and maximizing total cost, respectively. The previous studies have found the equilibrium solution by taking transportation, penalty and restoration cost into account. To contribute further, we examine how incor...
متن کاملA Novel Reordering Model for Statistical Machine Translation
Word reordering is one of the fundamental problems of machine translation, and an important factor of its quality and efficiency. In this paper, we introduce a novel reordering model based on an innovative structure, named, phrasal dependency tree including syntactical and statistical information in context of a log-linear model. The phrasal dependency tree is a new modern syntactic structure b...
متن کامل